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Abstract 

Dream I1, a popular fantasy sports platform, gives cricket enthusiasts a chance to 
select a dream team of players and earn rewards. The proposed paper presents a 
system that would assist users in preparing dream teams for cricket matches. The 
proposed system introduced a model designed to help the user in selecting ideal 
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Rerwores: os teams by applying machine learning techniques. The machine learning model 
Chatbot, Decision ‘ : oe : 

: combined cricket data, head-to-head player statistics, and venue-wise data, and 
Making, Fantasy : : : 
Co Cradiont then assigned scores to all players from both teams. The system mainly assigned 

pars ah higher weights to the players from the predicted winning team but also considered 
Boosting, Predictive : E 
Model the performance of key players from the opposing team. The chatbot interface 


allowed continuous interaction with the user to allow them to gain information 
about cricket. The system provided an evaluation of the methodology used, and the 
use of the predictive model, which resulted in displaying player scores for decision- 
making. 


1. Introduction 


Cricket is an engaging sport all over the world. 
Dream11, a fantasy sports platform developed that 
allowed cricket enthusiasts to create dream teams 
and win different rewards. The proposed system 
addressed the audience by introducing a solution 
that combined prediction skills with a chatbot 
interface. The necessity for the proposed system 
arrived from the rising interest in cricket 
predictions, and the growing demand for data- 
related decision-making. As of the latest update, no 
system offered a combination of Dream11 
prediction and chatbot functionalities. The idea of 
combining prediction with a chatbot interface gave 
a new approach to the domain. The technologies for 
prediction till now mainly depended on player 
information which was not sufficient for accurate 
predictions. To increase the accuracy of prediction 
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features such as head-to-head player data, venue- 
wise player performance, pitch conditions, and 
weather data needed to be included. This helped to 
provide the users with more accurate player 
recommendations for fantasy teams. The paper by 
Nilesh M. Patil et al. (2020) [6] introduced a cricket 
team winning prediction model, that used the 
Random Forest Algorithm and Decision Tree 
classifiers to predict the optimal playing eleven 
based on the player's previous performance. It 
integrated batting, bowling, and overall abilities to 
remove bias in player selection, which offered an 
understanding of individual and team performance 
using virtual simulations. Ayush Tripathi et al. 
(2020) [2] used feature engineering, that mainly 
focused on the Indian Premier League (IPL). By 
using machine learning techniques like neural 
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networks and support vector machines, it assisted in 
the prediction of match outcomes, and bowler 
performance evaluation. Saurav Singla et al. (2020) 
[13] used a Memetic Genetic Algorithm for player 
selection. It used an integer optimization method 
based on the player’s previous ten matches’ 
performance and suggested enhancements that 
included head-to-head data and venue-specific 
performance. Rohit Tamrakar et al. (2021) [10] 
paper introduced a CHATBOT. This chatbot used 
Artificial Intelligence and machine learning and 
provided practical examples and further, helped by 
proposing how it would be used in CAD 
applications. Sachin Kumar S et al. (2022) [12] 
developed Fantasy Sports Platforms, which focused 
on data science and analytics for decision-making, 
and combined Greedy and Knapsack algorithms for 
team selection and utilizing PyCaret and Plotly 
Python libraries for prediction and visualization. 
Based on the previous work [14], analysis of 
venues, team fitness, and other factors such as 
weather and pitch conditions significantly 
improved prediction accuracy. This was 
accompanied by considering various classifiers and 
machine learning algorithms, with a detailed search 
for optimal hyperparameters. This research paper 
covered the proposed implementation plan, by 
examining various techniques used in Dream11 
prediction. It introduced a model that focused on 
overcoming the limitations that were there in the 
existing approaches. The proposed system aimed to 
provide an efficient prediction model that integrated 
with the chatbot interface. The paper explored the 
entire project lifecycle, from conceptualization and 
methodology to implementation and results. 
2. Methodology 

2.1 Architecture 
Figure 1 shows the System Architecture, which 
consists of a user interface that allows the users to 
connect with the system. Users could gain access to 
information related to cricket as well as fantasy 
cricket prediction and engage with the unified 
chatbot. The backend service included a prediction 
model that exercised algorithms of machine 
learning for the prediction of the Dream 11 team. 
The Database Layer stored historical cricket data, 
head-to-head player statistics, and player 
performance at each venue, which provided the 
information needed by the prediction model and 
chatbot response. 
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ataset 


Figure 1 Architecture of The System 


2.2 Dataset Description 
For the proposed system, three main datasets were 
used to increase the precision and effectiveness. 
The main dataset used included cricket data which 
contained a record of all IPL matches that were 
played up to date [15]. This dataset helped to find 
the winner, toss outcome, and key performers of 
each match. The next dataset contained a head-to- 
head analysis [16] of each player against each other. 
This consisted of how each batsman played against 
each bowler. The last dataset included the player's 
performance at different venues [17]. This included 
batsman performance as_ well as_ bowler 
performance. 
2.3 Dataset Processing 
2.3.1 Head-to-Head Data 
A formula was used to calculate a performance 
index for each player against every bowler. The 
formula was expressed as follows: 
Per formanceindex 
= (Weights['Runs'] 
x data['Runs’] 
+ Weights|'Balls'| 
x data['Balls’| 
+ Weights|['Wickets’] 
x data['Wickets'] 
+ Weights|'StrikeRate’ 
x data|'StrikeRate’]) 


Following the calculation of the performance index, 
normalization was performed to scale the values 
between the range [0, 1]: 


Oo 
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Normalized Perf Index 
_ Performance_index—min _perf_index 


x 20 


max_perf_index — min_perf_index 


Later, the normalized performance index was mapped 
to distinct scores within the range of 0 to 10: 


Batsman_Score=round(Normalized_(Perf Index )) 


This approach ensured a meaningful and standardized 
evaluation of each player's batting performance, which 
helped the accuracy and relevance of the predictive 
algorithms integrated into the cricket prediction system. 
2.3.2 Venue-Wise Player Performance 
Data 
At each venue, the batsmen and bowlers perform 
differently. Probability Score (PS) was used to calculate 
the performance of players. For batting performance, it 
combined batting average and strike rate at each venue 
where the match was played. The PS for a batsman Bi at 
venue V was calculated using the following formula: 


PSpi=(2 X Avg'si,v) + (3 X SR‘si,v) 


Where Avg*si, v and SR"; v represented the batting 
average and strike rate of batsman Bj at venue V, 
respectively. Similar to batting performance, bowling 
performance was also assessed. The factors that were 
taken into consideration were the number of wickets 
taken and the economy rate of the bowlers at each 
venue. The PS for a bowler Bj at venue V was 
computed as: 


PSB = (3 X Wkts*3v) + (2 X (1 - ER*5,v)) 


Here, Wkts*z; vand ER’x; v denoted the normalized 
wickets taken and the economy rate of bowler Bj at 
venue V, respectively. 


2.4 Score Calculation 


Learning 

Based on the scores calculated and the machine 
learning model, the proposed system then used a 
technique for calculating the scores of the players. 
Initially, using the machine learning algorithm, 
winning and losing teams were determined and 
initial scores were assigned to all the players of the 
teams. Higher scores were assigned to players 
belonging to the winning team. The system then 
analyzed the performance of key players of both 
teams from the last two matches and updated the 
scores. Further, the scores calculated from venue- 
wise data and head-to-head data were added to 
adiust the scores. The scores were then combined 
and normalized and a final score was calculated for 
each player. 


Using Machine 
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2.5 Chatbot Model 
With the help of Botpress [5], a chatbot was 
developed. This allowed users to interact with the 
bot and find cricket-related information. The 
information provided included match statistics, 
player information, historical data, or any other 
information needed. By concentrating on cricket, 
the users are ensured that they receive the 
information they want. Using Botpress, users are 
guaranteed a _ streamlined experience, easy 
accessibility, and engagement. 
3. Implementation 

3.1 Support Vector Machine 
The Results should include the rationale or design 
of the experiments as well as the results of the 
experiments. Results can be presented in figures, 
tables (1), and text. 

3.2 AdaBoost Classifier 
The AdaBoost Classifier [11] trained weak 
classifiers on match data iteratively and then 
adjusted the focus on difficult ones. It was used to 
predict the match winners based on features like 
home team, away team, and venue as it combined 
weak learners into strong learners to improve 
accuracy. 

3.3 Logistic Regression 
Logistic Regression (LR) [3-8] tried to fit a logistic 
curve to the input data. which consisted of features 
such as home team. awav team. and venue. This 
helped to predict winners bv predicting the 
probability of the winning team with accuracy. 

3.4 Random Forest Classifier 
The Random Forest Classifier [1-2] combined 
multivle decision trees to predict the winning team. 
The final prediction of this classifier is calculated 
bv combining predictions of individual trees to 
improve accuracy. remove overfitting, and make 
reliable choices for match winner. 

3.5 Gradient Boosting Classifier 
The Gradient Boosting Classifier [7]. built a 
sequence of decision trees using features such as 
home team awav team. and toss outcome during 
training to predict the winning team. This classifier 
helped in achieving accuracy which handled 
complex interactions between features. 
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Figure 2 Evaluation of Different Models 
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As shown in figure 2. the graph shows the 
evaluation metrics for different machine learning 
models implemented. 
4. Result and Analysis 
Different machine learning algorithms were used to 
train the model and provide a set of outcomes. 
Evaluation metrics including Accuracy. Precision. 
Recall. and Fl-score were used to measure the 
performance of the machine learning algorithms 
[9]. The formulas for the evaluation metrics are as 
follows: 

dpe 

TP +TN +FP+FN 


True Positive 


Precision = = 
True Positive + False Positive 


True Positive 


Recall = ———___—______—_—__ 
—_ True Positive + False Negative 


2 X Precision X Recall 
F1 Score = ———————_- 
Precision + Recall 


The following data shows the results that were 
obtained by emploving the different machine- 
learning algorithms for model training: 


Table 1 Result Analysis 
Algorithm | Accuracy | Precision | Recall | F1 


Score 
Support 13.33 3.13 6.63 3.51 
Vector 
Machine 
AdaBoost | 17.77 3.20 9.12 4.51 
Classifier 
Logistic 20.55 9.83 13.11 | 11.12 
Regression 
Random 61.67 55.77 44.32 | 45.16 
Forest 
Gradient 73.89 69.18 70.26 | 69.17 
Boosting 


Based on the results observed, the Gradient 
Boosting Classifier was found to give the most 
accurate prediction for the proposed model with an 
accuracy of 73.89%. By including the Gradient 
Boosting Classifier in the model, based on the input 
given by the user the winning team was predicted. 
Based on the predictions, further score calculations 
of players were done and then displayed back to the 
users. 


Conclusion 

The proposed system showcased advancement in 
Dream11 prediction models. The system integrated 
the Dream11 prediction model with a chatbot 
interface that overcame the limitations that were 
there in the existing platforms by increasing the 
accuracy of prediction. With the help of a user- 
friendly interface, it enabled users to obtain the 
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fantasy team. The fusion of the machine learning 
model and chatbot in the system helped to enhance 
user experience and engagement. With the new 
features added, the probability of obtaining an 
accurate team increased and helped the users gain 
rewards on the fantasy application. 
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